Avoiding distortions due to speech coding and transmission errors in GSM ASR tasks
نویسندگان
چکیده
In this paper, we have extended our previous research on a new approach to ASR in the GSM environment. Instead of recognizing from the decoded speech signal, our system works from the digital speech representation used by the GSM encoder. We have compared the performance of a conventional system and the one we propose on a speaker independent, isolateddigit ASR task. For the half and full-rate GSM codecs, from our results, we conclude that the proposed approach is much more effective in coping with the coding distortion and transmission errors. Furthermore, in clean speech conditions, our approach does not impoverish the recognition performance, even recognizing from GSM digital speech, in comparison with a conventional system working on unencoded speech.
منابع مشابه
Speech Recognition over Mobile Networks
This chapter addresses issues associated with automatic speech recognition (ASR) over mobile networks, and introduces several techniques for improving speech recognition performance. One of these issues is the performance degradation of ASR over mobile networks that results from distortions produced by speech coding algorithms employed in mobile communication systems, transmission errors occurr...
متن کاملTowards improving ASR robustness for PSN and GSM telephone applications
In real-life applications, errors in the speech recognition system are mainly due to inefficient detection of speech Ž . segments, unreliable rejection of Out-Of-Vocabulary OOV words, and insufficient account of noise and transmission channel effects. In this paper, we review a set of techniques developed at CNET in order to increase the robustness to mismatches between training and testing con...
متن کاملAnalysis and on-line detection of audible distortions in GSM telephony
Channel errors significantly impair the quality of GSM transmitted speech. In this contribution, we first analyze the speech distortions caused by error control failures or due to the basic frame substitution technique used by the GSM Full Rate. In particular, we show how this frame substitution procedure may introduce frame rate harmonics in the speech spectrum. Then we present algorithms that...
متن کاملRecognition from GSM digital speech
This paper addresses the problem of speech recognition in the GSM environment. In this context, new sources of distortion, such as transmission errors or speech coding itself, significantly degrade the performance of speech recognizers. While conventional approaches deal with these types of distortion after decoding speech, we propose to recognize from the digital speech representation of GSM. ...
متن کاملAutomatic Speech Recognition in GSM Network Using the Bit-Stream and Auxiliary parameters
The Global System for Mobile (GSM) environment includes three main problems for Automatic Speech Recognition (ASR) systems: noisy scenarios, source coding distortion and transmission errors.The second, source coding distortion must be explicitly addressed.The front-end of the speech recognition system combines feature extracted by converting the quantized spectral information of speech coder, p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999